Feeds to Scour
SubscribedAll
Scoured 18226 posts in 487.2 ms
A Comprehensive Evaluation of LLM Reasoning: From Single-Model to Multi-Agent Paradigms
arxiv.orgยท1d
๐Ÿ—๏ธLLM Infrastructure
Preview
Report Post
Without benchmarking LLMs, you're likely overpaying 5-10x
karllorey.comยท1dยท
Discuss: Hacker News
๐Ÿ—๏ธLLM Infrastructure
Preview
Report Post
Dungeons & Dragons puts top AI models to the test
semafor.comยท13h
๐Ÿ†•New AI
Preview
Report Post
LLMs Under Siege: The Red Team Reality Check of 2026
eddieoz.comยท13hยท
Discuss: Hacker News
๐Ÿ•ณLLM Vulnerabilities
Preview
Report Post
Meet the IBM researchers trying to make LLMs smarter
research.ibm.comยท20h
๐Ÿ—๏ธLLM Infrastructure
Preview
Report Post
Confident Rankings with Fewer Items: Adaptive LLM Evaluation with Continuous Scores
arxiv.orgยท1d
๐Ÿ“ŠStatistical Ranking
Preview
Report Post
Could LLM alignment research reduce x-risk if the first takeover-capable AI is not an LLM?
lesswrong.comยท2d
๐Ÿ›ก๏ธAI Safety
Preview
Report Post
Get To Grips With Transformers And LLMs
i-programmer.infoยท1dยท
๐Ÿช„Prompt Engineering
Preview
Report Post
From 75% to 99.6%: The Math of LLM Ensembles
shibaprasadb.comยท1dยท
Discuss: Hacker News
๐Ÿง LLM Inference
Preview
Report Post
Stop Asking โ€œWhatโ€™s the Best LLM?โ€โ€Šโ€”โ€ŠHereโ€™s the Right Question
pub.towardsai.netยท6d
๐Ÿ—๏ธLLM Infrastructure
Preview
Report Post
On Evaluating Cognitive Capabilities in Machines (and Other "Alien" Intelligences)
aiguide.substack.comยท20hยท
Discuss: Substack
๐Ÿ›ก๏ธAI Safety
Preview
Report Post
AI SRE roundtable: The creation of a new category
thenewstack.ioยท10h
๐Ÿ†•New AI
Preview
Report Post
ChatGPTโ€™s Laws of Machine Learning
shruggingface.comยท1d
๐Ÿ›ก๏ธAI Security
Preview
Report Post
The three types of LLM workloads and how to serve them
modal.comยท17hยท
Discuss: Hacker News
๐Ÿ—๏ธLLM Infrastructure
Preview
Report Post
Co-optimization Approaches For Reliable and Efficient AI Acceleration (Peking University et al.)
semiengineering.comยท16h
โšกHardware Acceleration
Preview
Report Post
Building a Regulatory Risk Copilot with Databricks Agent Bricks (Part 1: Information Extraction)
databricks.comยท13h
๐Ÿ—๏ธLLM Infrastructure
Preview
Report Post
Thoughts on LLMs (closed- and open-source) in software development after one year of professional use.
ubaada.comยท3hยท
Discuss: r/LocalLLaMA
๐Ÿช„Prompt Engineering
Preview
Report Post
LLM API Providers Leaderboard - Comparison of over 500 AI Model endpoints
artificialanalysis.aiยท4d
๐Ÿฆ™Ollama
Preview
Report Post
The Disequilibrium Advantage - Log
nibzard.comยท1d
๐Ÿ’ฐTokenomics
Preview
Report Post
AI Systems Performance Engineering
github.comยท8hยท
Discuss: Hacker News
๐Ÿ“…Resource Scheduling
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help